RAW SEQTJRNfTR T JST1NG 



The Biotechnology Systems Branch of the Scientific and Technical 
Information Center <STTC) no errors detected- 
Application Serial Number: /c/^/c, (qQQ 

Source: ffuf . ~ 

Date Processed by STIC: /Qj2-)o(o 



ENTERED 



RAW SEQUENCE LISTING DATE: 12/12/2006 

PATENT APPLICATION: US/10/510 , 677 TIME: 14:39:30 



Input Set : F:\API-01-20-US-SeqList.ST25.txt 
Output Set: N:\CRF4\12122006\J510677.raw 



3 <110> APPLICANT: Sanofi Pasteur, Ltd. 

4 Therion Biologies, Inc. 

6 <120> TITLE OF INVENTION: Modified CEA Nucleic Acid and Expression Vectors 
8 <130> FILE REFERENCE: API-01-20-US 
10 <140> CURRENT APPLICATION NUMBER: 10/510,677 
C--> 11 <141> CURRENT FILING DATE: 2004-10-06 

13 <150> PRIOR APPLICATION NUMBER: US 60/370,972 

14 <151> PRIOR FILING DATE: 2002-04-09 
16 <160> NUMBER OF SEQ ID NOS : 32 

18 <170> SOFTWARE: Patentln version 3.3 

20 <210> SEQ ID NO: 1 

21 <211> LENGTH: 3564 

22 <212> TYPE: DNA 

2 3 <213> ORGANISM: Homo sapiens 
26 <220> FEATURE: 

2 7 <221> NAME/KEY: misc_f eature 

28 <222> LOCATION: ( 1663 )..( 1663 ) / 

29 <223> OTHER INFORMATION: n is a, c, g, or t 

31 <400> SEQUENCE: 1 

32 agcaggaccg gggcctgtgt cgctatgggt tcccccgccg ccccggaggg agegctggge 60 
34 tacgtccgcg agttcactcg ccactcctcc gacgtgctgg gcaacctcaa cgagctgcgc 12 0 
36 ctgcgcggga tcctcactga cgtcacgctg ctggttggcg ggcaacccct cagagcacac 180 
38 aaggcagttc tcatcgcctg cagtggcttc ttctattcaa ttttccgggg ccgtgcggga 240 
40 gtcggggtgg acgtgctctc tctgcccggg ggtcccgaag cgagaggctt cgcccctcta 300 
42 ttggacttca tgtacacttc gcgcctgcgc ctctctccag ccactgcacc agcagtccta 360 
44 gcggccgcca ectatttgea gatggagcac gtggtccagg catgccaccg cttcatccag 42 0 
46 gecagctatg aacctctggg catctccctg cgccccctgg aagcagaacc cccaacaccc 480 
48 ccaacggccc ctccaccagg tagtcccagg cgctccgaag gacacccaga cccacctact 540 
50 gaatctcgaa getgeagtea aggccccccc agtccagcca gccctgaccc caaggcctgc 600 
52 aactggaaaa agtacaagta categtgeta aactctcagg cctcccaagc agggagectg 660 
54 gteggggaga gaagttctgg tcaaccttgc ccccaagcca ggctccccag tggagacgag 72 0 
56 gcctccagca gcagcagcag cagcagcagc agcagcagtg aagaaggacc cattcctggt 780 
58 ccccagagca ggctctctcc aactgctgcc actgtgcagt tcaaatgtgg ggctccagcc 840 
60 agtaccccct acctcctcac atcccaggct caagacacct ctggatcacc etctgaaegg 900 
62 gctcgtccac tacegggagt gaatttttca getgecagaa ctgtgaggct gtggcagggt 960 
64 getcateggg ggctggactc cttggttcct ggggacgaag acaaacccta taagtgtcag 1020 
66 ctgtgccggt cttcgttccg ctacaagggc aaccttgcca gtcaccgtac agtgcacaca 1080 
68 ggggaaaagc cttaccactg ctcaatctgc ggagcccgtt ttaaccggcc agcaaacctg 1140 
70 aaaaegcaca gccgcatcca ttegggagag aagcegtata agtgtgagac gtgeggcteg 12 00 
72 cgctttgtac aggtggcaca tetgegggeg cacgtgctga tccacaccgg ggagaagece 1260 
74 tacccttgcc ctacctgcgg aacccgcttc cgccacctgc agaccctcaa gagecaegtt 132 0 
76 cgcatccaca ccggagagaa gccttaccac tgcgacccct gtggcctgca tttccggcac 1380 
78 aagagtcaac tgeggctgea tctgcgccag aaacaeggag ctgctaccaa caccaaagtg 1440 
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80 cactaccaca ttctcggggg gccctagctg agcgcaggcc caggccccac ttgcttcctg 1500 

82 cgggtgggaa agctgcaggc ccaggccttg cttccctatc aggcttgggc ataggggtgt 1560 

84 gccaggccac tttggtatca gaaattgcca ccctcttaat ttctcactgg ggagagcagg 1620 

W--> 86 ggtggcagat cctggctaga tctgcctctg ttttgctggt canaccctct tccccacaag 1680 

88 ccagattgtt tctgaggaga gagctagcta ggggctggga aaggggagag attggagtcc 1740 

90 tggtctccct aagggaatag ccctccacct gtggccccca ttgcattcag tttatctgta 1800 

92 aaatataatt tattgaggcc tttgggtggc accggggcct tcattcgatt gcatttccca 1860 

94 ctcccctctt ccacaagtgt gattaaaagt gaccagaaac acagaaggtg agatcacagc 192 0 

96 tctgctggca gagattacta gcccttggct ctctcgtttg gcttgggtat tttatattat 1980 

98 ttctgtcata acttttatct ttagaattgt tctttctcct gtttgtttgc ttgttagttt 2040 

100 gtttaaaatg gaaaaagggg ttctctgtgt tctgcccctg taattctagg tctggaacct 2100 

102 ttatttgttc tagggcagct ctgggaacat gcgggattgt ggaattgggt caggaaccct 2160 

104 ctctggtatt ctggatgttg taggttctct agcagtctag aaatggatac agacatttct 222 0 

106 ctgttcttca agggtgatag gaaccattat gttgagccca aaatggaagt aataataaat 2280 

108 gcctcctgga ggctgtgggt gtgggggatt ctgtatctgg attccgtatc actccaactg 2340 

110 gaggctgtgg gtgtggggga ttctgtatct ggattccgta tcactccaag tggaggctgg 2400 

112 caggtttttc tgcaagatgg tccagaatct aaaatgtccc attaatctgg tcacttgggt 2460 

114 ttggctctgc tgtatccatc tatagtggta gagacccacc agggctcaag tggagtccat 2520 

116 catcctccca cgggggcctg ttcttagtac tgagttgatc gctccatggg ggagagatca 2580 

118 gacattcctt atcagagatg atgtgacctt ttctgactct gcccagtctc tatgaatgtt 2640 

120 atggcctagg gaagaatcat gaaactcttt agcttgatta gatggtaaac agtgttaacc 2700 

122 catcctttac tacagaggca tatgggtttg aatgttacct ggggttctct ctattgagtt 2760 

124 gagccccttc ttcctttagt gggttttgga catcttctgg caagtgtcca gatgccagaa 2 82 0 

126 ccttcttttc ctctagaagg gatggtgctt ggtaacctta ccttttaaaa gctgggtctg 2880 

128 tgacctggtc ttcccatccc tgcattcctg tctggaacca gtgaatgcat tagaaccttc 2940 

130 cataggaaaa gaaaaggggc tgagttccat tctgggtttg ctgtagtttg gttgggatta 3000 

132 ttgttggcat tacagatgta aaagattgac tagcccatag gccaaaggcc tgttctagtt 3060 

134 gaccaagttt caagtaggat taagaggttg gttgaggggt gcagtttctg gtgtaggcca 312 0 

136 ggtaggtaga aagtgaggaa cagggttgcc tcttggctgg gtggagtctc tgaaatgtta 3180 

138 gaagaagcgc tgaagccttg attgatagtt ctgccccttg ttgccctggg gcttatctga 3240 

140 ttatgggacg agggtagaaa gtaagaagca cttttgaatt tgtggggtag aacttcaaca 3300 

142 ataagtcagt tctagtggct gtcgcctggg gactagtgag aaagctactc ttctccctct 3360 

144 tccctctttc tccccatggc cccactgcag aattaaagaa ggaagaaggg aaggcggagg 342 0 

146 agtctataag aaggaatcat gatttctatt tagcagattg gatgggcagg tggagaatgc 3480 

148 ctgggggtag aaatgttaga tcttgcaaca tcagatcctt ggaataaaga agcctctctg 3540 

150 cgcaaaaaaa aaaaaaaaaa aaaa 3564 

153 <210> SEQ ID NO: 2 

154 <211> LENGTH: 1440 

155 <212> TYPE: DNA 

156 <213> ORGANISM: Homo sapiens 

158 <400> SEQUENCE: 2 

159 atgggttccc ccgccgcccc ggagggagcg ctgggctacg tccgcgagtt cactcgccac 60 
161 tcctccgacg tgctgggcaa cctcaacgag ctgcgcctgc gcgggatcct cactgacgtc 120 
163 acgctgctgg ttggcgggca acccctcaga gcacacaagg cagttctcat cgcctgcagt 180 
165 ggcttcttct attcaatttt ccggggccgt gcgggagtcg gggtggacgt gctctctctg 240 
167 cccgggggtc ccgaagcgag aggcttcgcc cctctattgg acttcatgta cacttcgcgc 300 
169 ctgcgcctct ctccagccac tgcaccagca gtcctagcgg ccgccaccta tttgcagatg 360 
171 gagcacgtgg tccaggcatg ccaccgcttc atccaggcca gctatgaacc tctgggcatc 42 0 
173 tccctgcgcc ccctggaagc agaaccccca acacccccaa cggcccctcc accaggtagt 480 
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PATENT APPLICATION : US/10/510,677 



DATE: 12/12/2006 
TIME: 14:39:30 



Input Set : F:\API-01-20-US-SeqList.ST25.txt 
Output Set: N:\CRF4\12122006\J510677.raw 



175 cccaggcgct ccgaaggaca cccagaccca cctactgaat ctcgaagctg cagtcaaggc 
177 ccccccagtc cagccagccc tgaccccaag gcctgcaact ggaaaaagta caagtacatc 
179 gtgctaaact ctcaggcctc ccaagcaggg agcctggtcg gggagagaag ttctggtcaa 
181 ccttgccccc aagccaggct ccccagtgga gacgaggcct ccagcagcag cagcagcagc 
183 agcagcagca gtgaagaagg acccattcct ggtccccaga gcaggctctc tccaactgct 
185 gccactgtgc agttcaaatg tggggctcca gccagtaccc cctacctcct cacatcccag 
187 gctcaagaca cctctggatc accctctgaa cgggctcgtc cactaccggg aagtgaattt 
189 ttcagctgcc agaactgtga ggctgtggca gggtgctcat cggggctgga ctccttggtt 
191 cctggggacg aagacaaacc ctataagtgt cagctgtgcc ggtcttcgtt ccgctacaag 
193 ggcaaccttg ccagtcatcg tacagtgcac acaggggaaa agccttacca ctgctcaatc 
195 tgcggagccc gttttaaccg gccagcaaac ctgaaaacgc acagccgcat ccattcggga 
197 gagaagccgt ataagtgtga gacgtgcggc tcgcgctttg tacaggtggc acatctgcgg 
199 gcgcacgtgc tgatccacac cggggagaag ccctaccctt gccctacctg cggaacccgc 
2 01 ttccgccacc tgcagaccct caagagccac gttcgcatcc acaccggaga gaagccttac 
2 03 cactgcgacc cctgtggcct gcatttccgg cacaagagtc aactgcggct gcatctgcgc 
205 cagaaacacg gagctgctac caacaccaaa gtgcactacc acattctcgg ggggccctag 

208 <210> SEQ ID NO: 3 

209 <211> LENGTH: 65 

210 <212> TYPE: DNA 

211 <213> ORGANISM: Homo sapiens 

213 <400> SEQUENCE: 3 

214 atacccggaa ctccctaagc cttctattag ctccaataat agtaagcctg tcgaagacaa 
216 agatg 

219 <210> SEQ ID NO: 4 

220 <211> LENGTH: 70 

221 <212> TYPE: DNA 

222 <213> ORGANISM: Homo sapiens 

224 <400> SEQUENCE: 4 

225 gcctgtgtcc cctagactcc aactcagcaa cggaaataga actctgaccc tgtttaacgt 
227 gaccaggaac 

230 <210> SEQ ID NO: 5 

231 <211> LENGTH: 70 

232 <212> TYPE: DNA 

233 <213> ORGANISM: Homo sapiens 

235 <400> SEQUENCE: 5 

236 acgtgcttta cggacccgat gctcctacaa tcagccctct aaacacaagc tatagatcag 
238 gggaaaatct 

241 <210> SEQ ID NO: 6 

242 <211> LENGTH: 70 

243 <212> TYPE: DNA 

244 <213> ORGANISM: Homo sapiens 

246 <400> SEQUENCE: 6 

247 acgttaaaca gggtcagagt tctatttccg ttgctgagtt ggagtctagg ggacacaggc 
24 9 agggactggt 

252 <210> SEQ ID NO: 7 

253 <211> LENGTH: 70 

254 <212> TYPE: DNA 

255 <213> ORGANISM: Homo sapiens 
257 <400> SEQUENCE: 7 
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PATENT APPLICATION: US/10/510 , 677 
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TIME: 14:39:30 



Input Set : F:\API-01-20-US-SeqList.ST25.txt 
Output Set: N:\CRF4\12122006\J510677.raw 



258 ctgatctata gcttgtgttt agagggctga ttgtaggagc atcgggtccg taaagcacgt 
260 tgagaatcac 

263 <210> SEQ ID NO: 8 

264 <211> LENGTH: 63 

265 <212> TYPE: DNA 

266 <213> ORGANISM: Homo sapiens 

268 <400> SEQUENCE: 8 

269 gatccactat tgttcacggt aatattggga atgaacagtt cctgggtgga ctgttggaaa 
271 gtg 

274 <210> SEQ ID NO: 9 

275 <211> LENGTH: 70 

276 <212> TYPE: DNA 

277 <213> ORGANISM: Homo sapiens 

279 <400> SEQUENCE: 9 

280 gacacagcaa gctacaaatg cgaaacccaa aatccagtca gcgccaggag gtctgattca 
282 gtgattctca 

285 <210> SEQ ID NO: 10 

286 <211> LENGTH: 70 

287 <212> TYPE: DNA 

2 88 <213> ORGANISM: Homo sapiens 
2 90 <400> SEQUENCE: 10 

2 91 tgaatcagac ctcctggcgc tgactggatt ttgggtttcg catttgtagc ttgctgtgtc 
293 gttcctggtc 

296 <210> SEQ ID NO: 11 

297 <211> LENGTH: 79 

298 <212> TYPE: DNA 

299 <213> ORGANISM: Homo sapiens 

301 <400> SEQUENCE: 11 

302 gatcctacac gtgccaagct cacaatagcg acaccggact caaccgcaca accgtgacga 
304 cgattaccgt gtatgccga 

307 <210> SEQ ID NO: 12 

3 08 <211> LENGTH: 70 
3 09 <212> TYPE: DNA 

310 <213> ORGANISM: Homo sapiens 

312 <400> SEQUENCE: 12 

313 catcctcaac tgggttagaa ttgttactag ttatgaatgg ttttggtggc tcggcataca 
315 cggtaatcgt 

318 <210> SEQ ID NO: 13 

319 <211> LENGTH: 80 
32 0 <212> TYPE: DNA 

321 <213> ORGANISM: Homo sapiens 

323 <400> SEQUENCE: 13 

324 ttctaaccca gttgaggatg aggacgcagt tgcattaact tgtgagccag agattcaaaa 
326 taccacttat ttatggtggg 

329 <210> SEQ ID NO: 14 

330 <211> LENGTH: 80 

331 <212> TYPE: DNA 

332 <213> ORGANISM: Homo sapiens 
334 <400> SEQUENCE: 14 
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PATENT APPLICATION: US/10/510 , 677 



DATE: 12/12/2006 
TIME: 14:39:30 



Input Set : F:\API-01-20-US-SeqList.ST25.txt 
Output Set: N:\CRF4\12122006\J510677.raw 



335 gtctaatgat aaccgcacat tgacactcct gtccgttact cgcaatgatg taggacctta 
337 tgagtgtggc attcagaatg 

340 <210> SEQ ID NO: 15 

341 <211> LENGTH: 80 

342 <212> TYPE: DNA 

343 <213> ORGANISM: Homo sapiens 

345 <400> SEQUENCE: 15 

346 tttgtatggc ccagacgacc caactatatc tccatcatac acctactacc gtcccggcgt 
348 gaacttgagc ctttcttgcc 

351 <210> SEQ ID NO: 16 

352 <211> LENGTH: 80 

353 <212> TYPE: DNA 

354 <213> ORGANISM: Homo sapiens 

356 <400> SEQUENCE: 16 

357 tgatggaaac attcagcagc atactcaaga gttatttata agcaacataa ctgagaagaa 
359 cagcggactc tatacttgcc 

362 <210> SEQ ID NO: 17 

363 <211> LENGTH: 80 

364 <212> TYPE: DNA 

365 <213> ORGANISM: Homo sapiens 

367 <400> SEQUENCE: 17 

368 taaaacaata actgtttccg cggagctgcc caagccctcc atctccagca acaactccaa 
370 acccgtggag gacaaggatg 

373 <210> SEQ ID NO: 18 

374 <211> LENGTH: 80 

375 <212> TYPE: DNA 

376 <213> ORGANISM: Homo sapiens 
3 78 <400> SEQUENCE: 18 

379 atgtgcggtt atcattagac aactgcaagc gtgggctaac cggcaaactt tggttattga 
381 cccaccataa ataagtggta 

384 <210> SEQ ID NO: 19 

385 <211> LENGTH: 80 

386 <212> TYPE: DNA 

387 <213> ORGANISM: Homo sapiens 

389 <400> SEQUENCE: 19 

390 ggtcgtctgg gccatacaaa acattaagga taacagggtc ggagtgatca acggataatt 
392 cattctgaat gccacactca 

395 <210> SEQ ID NO: 20 

396 <211> LENGTH: 80 

397 <212> TYPE: DNA 

398 <213> ORGANISM: Homo sapiens 

400 <400> SEQUENCE: 20 

401 gctgctgaat gtttccatca atcagccagg agtactgtgc aggggggttg gatgctgcat 
403 ggcaagaaag gctcaagttc 

406 <210> SEQ ID NO: 21 

407 <211> LENGTH: 80 

408 <212> TYPE: DNA 

409 <213> ORGANISM: Homo sapiens 
411 <400> SEQUENCE: 21 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 12/12/2006 

PATENT APPLICATION: US/10/510 , 677 TIME: 14:39:31 

Input Set : F:\API-01-20-US-SeqList.ST25.txt 
Output Set: N:\CRF4\12122006\J510677.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of^ each sequence which presents at least one n or Xaa. 

Seq#:l; N Pos . 1663 
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VERIFICATION SUMMARY DATE: 12/12/2006 

PATENT APPLICATION: US/10/510 , 677 TIME: 14:39:31 

Input Set : F:\API-01-20-US-SeqList.ST25.txt 
Output Set: N:\CRF4\12122006\J510677.raw 

L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:86 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 after pos.:1620 
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